RNA Sequence to Structure Mapping

نویسنده

  • Stephan Kopp
چکیده

Motivated by the observation, that RNA folding gives rise to extended neutral networks in sequence space, concepts of random graph theory are applied to build a model of RNA sequence to structure mappings. This model allows to investigate generic properties of sequence-structure relations as well as the effect of neutrality. The random mapping construction is based on two tunable parameters. These parameters pu and pp resemble the average degree of neutrality for unpaired and paired part of the RNA secondary structure, respectively. In the model a set of secondary structures must be given. The mapping is performed by building the preimage of the structures. For this purpose, the set of sequences C is constructed which are compatible with a given structure s. From this set, sequences are chosen with a probability determined by pu and pp, and finally assigned to the structure s, if (and only if) this sequence has not been mapped to another structure. The properties we are focusing on are the existence of extended neutral nets in sequence space, the connectivity of these nets and their denseness in C. The mathematical theory for our model claims the existence of a threshold value for connectivity and denseness properties of neutral nets. The theorems hold in the limit of infinite chain length and determine the threshold value to be p = 1− κ−1 √ 1/κ in both cases. Here, κ is the size of the alphabet used to encode the unpaired or paired parts of the sequences, respectively. Below this threshold the nets are neither connected not dense in C, whereas above the threshold almost all nets are connected and dense in C. Computer experiments indicate that a threshold exists also for finite chain length, although it is not sharp anymore. However, within the accuracy of the simulations the threshold value is identical with the theoretically predicted one. Furthermore, it is identical for both properties. We investigate the influence of the tertiary contacts on generic properties of sequence-structure mappings. Instead of trying to predict tertiary structures of sequences we determine the tertiary contacts. Compatible sequences are then constructed according to an arbitrary base-pairing rule. This model also contains a tunable parameter determining the frequency of tertiary contacts in a structure. We show that in this model large neutral networks exist for tertiary structures even in the case where the structures contain a relatively high number of tertiary contacts.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Relation Between RNA Sequences, Structures, and Shapes via Variation Networks

Background: RNA plays key role in many aspects of biological processes and its tertiary structure is critical for its biological function. RNA secondary structure represents various significant portions of RNA tertiary structure. Since the biological function of RNA is concluded indirectly from its primary structure, it would be important to analyze the relations between the RNA sequences and t...

متن کامل

Comparative bioinformatics analysis of a wild diploid Gossypium with two cultivated allotetraploid species

Background: Gossypium thurberi is a wild diploid species that has been used to improve cultivated allotetraploid cotton. G. thurberi belongs to D genome, which is an important wild bio-source for the cotton breeding and genetic research. To a certain degree, chloroplast DNA sequence information are a versatile tool for species identification and phylogenetic implications in plants. Different ch...

متن کامل

Mapping Sequence diagram in Fuzzy UML to Fuzzy Petri Net

This ability in fuzzy UML, practically leaves the customers and market’s need without response in this important and vital area. Here, the available sequence diagrams in fuzzy UML will map into fuzzy Petri net. However, the formal models ability will be added to the Semi-formal fuzzy UML. This formalization will add the automatic processing ability to the Semi-formal fuzzy UML. Further more, t...

متن کامل

Modeling RNA Secondary Structure with Sequence Comparison and Experimental Mapping Data.

Secondary structure prediction is an important problem in RNA bioinformatics because knowledge of structure is critical to understanding the functions of RNA sequences. Significant improvements in prediction accuracy have recently been demonstrated though the incorporation of experimentally obtained structural information, for instance using selective 2'-hydroxyl acylation analyzed by primer ex...

متن کامل

Double Sequence Iterations for Strongly Contractive Mapping in Modular Space

In this paper, we consider double sequence iteration processes for strongly $rho$-contractive mapping in modular space. It is proved, these sequences, convergence strongly to a fixed point of the strongly $rho$-contractive mapping.

متن کامل

Identification of metallothionein gene structure in sterlet (Acipenser ruthenus)

Aquatic organisms present, not only simple sources of accumulated metal, but can interact with metals, altering their toxicity. Due to exposition of biosphere with metals, organisms have developed various defense mechanisms to protect themselves against adverse effects of these ions and their compounds. Metallothionein (MT) is one of that which represents a critical mechanism for detoxification...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1998